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Abstract. An arbitrary homomorphism between groups is nonincreasing for 
stable commutator length, and there are infinitely many (injective) homomor- 
phisms between free groups which strictly decrease the stable commutator 
length of some elements. However, we show in this paper that a random 
homomorphism between free groups is almost surely an isometry for stable 
commutator length for every element; in particular, the unit ball in the scl 
norm of a free group admits an enormous number of exotic isometries. 

Using similar methods, we show that a random fatgraph in a free group 
is extremal (i.e. is an absolute minimizer for relative Gromov norm) for its 
boundary; this implies, for instance, that a random element of a free group 
with commutator length at most n has commutator length exactly n and stable 
commutator length exactly n — 1/2. Our methods also let us construct explicit 
(and computable) quasimorphisms which certify these facts. 



1. Introduction 

1.1. Stable commutator length. If G is a group, the commutator length c\(g) 
of an element g £ G' is the least number of commutators in G whose product is g, 
and the stable commutator length is the limit lim„_j. 00 cl(<?")/n. Stable commutator 
length scl extends to a pseudo-norm on the space B\{G) of formal real (group) 
1-boundaries, and descends to a further quotient Bf (G) := Bx(G)/(g n — ng,g — 
hgh^ 1 ), reflecting the fact that scl is homogeneous (by definition), and a class 
function. When G is hyperbolic, scl is a norm on By (G) ([S], Thm. A'). The 
crucial properties of this pseudo-norm in general are 

(1) (characteristic) it is constant on orbits of Out(G); and 

(2) (monotone) it is nonincreasing under homomorphisms between groups. 

Of course the first property follows from the second. 

1.2. Exotic isometries. If G admits a large group of automorphisms, the charac- 
teristic property becomes very interesting. Perhaps the most interesting example 
is the case of a free group F; in this case, we obtain a natural isometric action 
of Out(F) on the normed space B^(F). In fact, the unit ball in the scl norm on 
B\ ' (F) is a polyhedron, and associated to every realization of F as tti(S) for S a 
compact, oriented surface, there is a top dimensional face its of the unit ball whose 
stabilizer in Out(F) is precisely the mapping class group MCG(5); see [SJ |5] for 
proofs of these facts. 

There are many natural realizations of MCG(5) and Out(F') as groups of isome- 
tries of geometric spaces. Inevitably, these spaces admit essentially no other isome- 
tries (up to finite index). For example, in the case of MCG(5) acting on Teichmuller 
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space, this is a famous theorem of Roy den [20j . In marked contrast to these exam- 
ples, our first main result is that the scl unit ball in Bi(F) admits an enormous 
number of exotic isometries, and in fact we show that a random homomorphism 
between free groups is almost surely an isometry for stable commutator length: 

Random Isometry Theorem 13.161 A random homomorphism p : Fk — » Fi 
of length n between free groups of ranks k, I is an isometry of scl with probability 
1 — 0(C(k,l)~ n ) for some constant C(k,l) > 1. 

Here a random homomorphism of length n is one which sends the generators of 
Fk to randomly chosen elements of Fi of length at most n. 

We remark that in [3j (Lem. 6.1) Bestvina-Feighn obtained partial results in the 
direction of this theorem. Explicitly, for any element w in a free group F, and for 
any other free group F' , they constructed many homomorphisms <p : F — » F' for 
which the commutator length (not the stable commutator length) of p>(w) in F' is 
equal to the commutator length of w in F. In fact, their technique implies (though 
they do not state this explicitly) that for each fixed w, a random homomorphism of 
length n has this property with probability 1 — 0(C(w)~ n ). However the constant 
C(w) they obtain definitely depends on w, and therefore they do not exhibit a single 
homomorphism which is an isometry for commutator length for all w simultaneously 
(in fact, our proof of the Isometry Theorem should be valid with scl replaced by cl, 
but we have not pursued this). 

A necessary condition for a homomorphism between free groups to be an isometry 
for scl is for it to be injective. However, if k > 3 then there are many injective homo- 
morphisms Fk — > Fi that are not isometries; we give two infinite classes of examples, 
namely Example 12.21 and Example 12.71 In fact, we show (Proposition 12.91) that if 
Fk — > Fi is an isometry, then the image of Fk is necessarily self-commensurating 
in F\\ i.e. it is not properly contained with finite index in any other subgroup. 
Of course, any injective homomorphism Fi — > Fi has self-commensurating image. 
Extensive computer evidence (and some theory) has led us to make the following 
conjecture: 

Isometry Conjecture 14. li Let (p : F2 — > F be any injective homomorphism from 
a free group of rank 2 to a free group F. Then p is an isometry of scl. 

1.3. Extremal fatgraphs and quasimorphisms. There is a duality theorem 
(Generalized Bavard duality; see [5] or [8] Thm. 2.79; also see j2[) relating stable 
commutator length to an important class of functions called homogeneous quasi- 
morphisms. If G is a group, a function <fi : G — > R is a homogeneous quasimorphism 
if it satisfies 4>(g n ) = n<f>(g) for every g G G, and if there is a least non-negative 
number D((f>) (called the defect) so that for all g,h £ G, there is an inequality 



The space of homogeneous quasimorphisms on G is a vector space Q(G). The 
subspace on which D vanishes is naturally isomorphic to H 1 (G;M.), and D defines 
a norm on Q/ H 1 making it into a Banach space. 

Generalized Bavard duality is the statement that for all chains J2^i9i ^ &i {G) 
there is an equality 
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Because Q/H 1 is a Banach space, for any chain V € Bf (G) there exists a (j> for 
which equality holds — i.e. for which scl(r) = cf>(T)/2D((p). Such a quasimorphism 
is said to be extremal for T. 

It is a fundamental problem, given T, to exhibit an explicit <f> which is extremal 
for r. There are essentially no examples of (hyperbolic) groups in which one knows 
how to answer this problem for more than a handful of chains T. Upper bounds on 
scl are obtained for (integral) chains V by exhibiting nT for some n as the oriented 
boundary of a homotopy class of map S — > K[G, 1) for some compact oriented 
surface S with no disk or sphere components, and using the inequality 

sc i ( D = mf =*w 

K ' S 2n 

(see [5] or [8], Prop. 2.10). A surface realizing scl(r) = ~x(S)/2n is said to be 
extremal for T. For T in B± (G) for an arbitrary group G, an extremal surface need 
not exist. However, for a free group F, it turns out that extremal surfaces always 
exist, and can be found by a polynomial time algorithm (this is the Rationality 
Theorem from [5]; the algorithm is implemented by the program scallop |llj). 
For any surface S and any homogeneous quasimorphism cj> there is an inequality 
— x(5)/2 > sc\(dS) > 4>(dS)/2D((j)). A surface S and a quasimorphism <f> certify 
each other as extremal (for dS) if this inequality is an equality; i.e. if —\(S)/2 = 
^(dS)/2D{4>). 

In a free group, extremal (and other) surfaces bounding chains nT are encoded 
combinatorially as labeled fatgraphs. The details of this labeling are explained in 
§ 13. 2\ but the idea is just that the oriented edges of the fatgraph Y are labeled by 
elements of F in such a way that changing the orientation inverts the label; and 
then the oriented boundary of a surface thickening S(Y) of the fatgraph determines 
a finite collection of cyclic words in F which should represent nT in B^ (F). 

Our second main result is that if we fix the topological type of a fatgraph Y, 
most labelings Y give rise to extremal surfaces, and moreover we can explicitly 
construct (from the combinatorics of Y) an extremal homogeneous quasimorphism 
Hy which certifies that S(Y) and Hy are extremal: 

Random Fatgraph Theorem 15.101 For any combinatorial fatgraph Y, if Y is 
a random fatgraph over F obtained by labeling the edges of Y by words of length 
n, then S(Y) is extremal for dS(Y) and is certified by the extremal quasimorphism 
Hy, with probability 1 — 0{C{Y ,F)~ n ) for some constant C(Y,F) > 1. 

This implies that for any integer to, most words w in F with cl(io) < m satisfy 
cl(w) = to and scl(w) = to — 1/2. 

To be useful in practice, it is important to have some idea of the size of the 
constants C(Y,F) arising in the Random Fatgraph Theorem. In § 15.61 we tabulate 
the results of computer experiments for F = Fi and for trivalent Y; the triva- 
lent hypothesis significantly simplifies the construction of Hy and the verification 
of the certificate. The constants that arise are reassuringly small, affirming the 
effectiveness of the Random Fatgraph Theorem. 

2. INJECTIVE ENDOMORPHISMS OF FREE GROUPS ARE NOT ALWAYS ISOMETRIC 

2.1. A question of Bardakov. If G is a group, and G' is its commutator sub- 
group, the commutator length of an element g £ G' (denoted cl(g)) is the least 
number of commutators in G whose product is g. 
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Bardakov ,1] asked the following question: 

Question 2.1 (Bardakov, [I] qn. 2). Let Lp : F — > F be an injective endomorphism 
of a nonabelian free group F. Does c\(g) — cl((p(g)) for all g £ F' ? 

The answer to Bardakov's question is no. We give two infinite families of exam- 
ples to substantiate this claim. The first family of examples use some facts from 
the theory of 3-manifold topology, and were inspired by a conversation with Geoff 
Mess. 

Example 2.2 (Complex of curves; [5], Ex. 4.44). Let H be a handlebody of genus 3. 
Let 7 be an essential simple closed curve in dH, dividing dH into two subsurfaces 
Si, S*2 of genus 1 and 2 respectively. The inclusions Si — > dH are necessarily tti- 
injective, though the inclusions Si — >• H are typically not. However, Dehn's lemma 
(see [14]) says that if Si — > H is not injective, there is an essential simple closed 
curve 7 in Si that bounds an embedded disk in H. 

The set of isotopy classes of essential simple closed curves in dH are the vertices 
of a graph C(dH) called the complex of curves. Two vertices are joined by an edge 
in this complex if and only if they are represented by disjoint curves in dH. If we 
declare that each edge has length 1, the graph C(dH) becomes a (path) metric space, 
with distance function d(-, ■). Let C(H) denote the subset of vertices consisting of 
essential simple closed curves in dH that bound disks in H. 

It is known ([TS], Thm. 2.7) that there exist pseudo-Anosov mapping classes -0 of 
dH so that for any a 6 C(dH) the iterates tp n (a) satisfy d(ip n (a),C(H)) ->■ oo. If (3 
is an arbitrary essential loop in S2 then d(/3, 7) < 1, since (3 and 7 = 9S*2 are disjoint. 
If V is as above, and n is such that d(ip n ('y),C(H)) > 2, then d(^ n (/3),C(H)) > 1 
for all essential simple closed curves /3 in S2. It follows from Dehn's lemma that 
the inclusion ^"(S^) — » H is 7Ti-injective. 

Let F = 7Ti(S , 2), a free group of rank 4, and let g € F' be the conjugacy class 
associated to the loop 952- A simple degree argument implies that cl(<7) ^ 1 and 
therefore c\(g) = 2. Let ip : F — > F be the endomorphism induced by the inclusion 
£2 — ► ip n {S2) ~> H composed with any injective homomorphism %x(H) — > F. Since 
the image of g in tti(H) is represented by dS\, this image is a commutator. Hence 

ch>(<?)) = i. 

2.2. Stable commutator length. If G is a group, and g £ G', the stable com- 
mutator length of g (denoted scl(<?)) is the limit scl(g) := \mi n ^ ) . 00 c\(g n )/n. Stable 
commutator length is a more interesting and subtle invariant than commutator 
length, and is connected to a broader range of mathematical subjects, such as hy- 
perbolic geometry, topology, symplectic dynamics, bounded cohomology, etc. See 
[8] for a systematic introduction. 

It is convenient to extend the definition of (stable) commutator length to finite 
formal sums of elements. Suppose gi are a finite collection of elements in G whose 
product is in G' . Define c\Q2 gi) to be the minimum of the commutator length of 
any product °f conjugates of the gi, and define SclQ^j) to be the limit of 

cl(%2 9i)/n as n — > 00. 

It is shown in [S], § 2.4 (also see [5], § 2.6) that scl extends to a pseudo-norm on 
-Bi(G), the vector space of real group 1-boundaries (in the sense of the bar complex 
in group homology), and vanishes on the subspace H spanned by chains of the 
form g n — ng for g e G, n £ Z and g — hgh~ x for g, h £ G (note that H includes 
all torsion elements). Thus scl descends to a pseudo-norm on the quotient space 
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B^ := Bi/H. When G is a Gromov hyperbolic group (for example, when G is 
free), scl defines a genuine norm on B±{G)\ this follows from [9], Thm. A' (the 
separation theorem). 

If G is a group in which (nontorsion) elements are not infinitely divisible, it is 
convenient to think of an element of B± as a (homologically trivial) finite formal 
real linear combination of primitive conjugacy classes. Such objects arise frequently 
in low-dimensional geometry, e.g. in the Selberg trace formula, or in Thurston's 
theory of train tracks. 

Definition 2.3. A homomorphism between groups ip : G —> H is isometric if 
scl G (r) = seiner)) for all T € Bf (G). 

Note that an isometric homomorphism between free groups is necessarily injec- 
tive. 

Example 2.4. Any automorphism is isometric. 

Example 2.5. An inclusion G — ► H that admits a section H — > G is isometric. 

Example 2.6. An endomorphism of a free group that sends every generator to a 
nontrivial power of itself is isometric ([7], Cor. 3.16). 

Our next family of examples depend on the main theorems of [6], and we refer 
the reader to that paper for details. 

Example 2.7 (Nongeometric covers). Let F be a free group, and let G be a finite 
index subgroup of F. Let i : G F denote the inclusion. A realization of a 
free group is a conjugacy class of isomorphism G —¥ 7Ti(E) where E is a compact, 
connected, oriented surface (necessarily with boundary) . Associated to a realization 
there is a well-defined chain <9£ 6 B^(G). Say that a realization G — > 7i"i(E) is 
geometric if there is a realization F %x (S) and a finite cover E — > S inducing 
i : G — s> F. For a geometric realization, i takes the equivalence class of the chain 
dT, to the class of the chain [F : G] ■ dS, and there are equalities: 

-x(E)/2 = scl G (<9£) = Bdjr(i„0E) = [F : G] • scl F (<9S) = — [F : G] ■ x(5)/2 

However, if G —¥ tti(E) is nongeometric, it is always true that there is a strict 
inequality 

scl G (<9£) > sc\ F (i*dY,) 
so that such i* are never isometric; see Proposition 12.91 below. 

Note if the rank of G is even, there are many nongeometric realizations for which 
9E is connected. This gives many negative examples to Bardakov's question, since 
if sc\((p(g)) < scl(g) for some element g and some (p, then necessarily c\(cp(g n )) < 
c\(g n ) for some n. 

Note that every finite index subgroup G of F does in fact admit nongeometric 
realizations; hence G — > F is never isometric. Such an inclusion can be further 
composed with another injective homomorphism to produce many examples. 

Definition 2.8. A finitely generated subgroup G < F is self-commensurating in F 
if there is no finitely generated subgroup E < F with G proper of finite index in E. 

We summarize this example in a proposition. 

Proposition 2.9. If G — > F is an isometric homomorphism between finitely gen- 
erated free groups, then the image of G is self-commensurating in F. 
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The proof of this proposition is somewhat technical, depending on the main 
results of [6]. However, as the proposition is not used elsewhere in the article, the 
reader who is not familiar with [B] may skip it. 

Proof. Let G — > E be a proper inclusion of finite index between finitely generated 
free groups. We show G — > E is not isometric, and therefore neither is G — > F. 

Let G — » 7Ti(S) be a nongeometric realization; i.e. E does not cover a realization 
of F. It is easy to see that every realization of a free group is extremal for its 
boundary; i.e. E is extremal for 9E in G, so sc1g(9E) = — x(E)/2 (for the definition 
of an extremal surface, look ahead to § 13.11) . Let H be a subgroup of G of finite 
index, normal in E. There is a realization H —> 7i"i(E) for some finite cover E of E, 
and E is extremal for 9E in H. Since H — > 7Ti(E) is geometric with respect to G, 
there is an equality sc1h-(<9E) = sc1g(<9E) = — y(E)/2. 

On the other hand, since G — > tt\ (E) is not geometric with respect to E, neither is 
H — > 7Ti(E). Since H is normal in E, there is some e £ E which acts by conjugation 
on H as an outer automorphism e* of H not in MCG(E). By [6] Thm. A the classes 
9E and e»<9E projectively intersect the interiors of different top dimensional faces 
of the scl norm ball of H, and therefore scl#(9E + e*<9E) < 2<9E. Since scl is a 
norm, there is an inequality 

sclff(V pF/ff e»<9E) 

scl B (aE) = , „ < scl H (5E) = scl G (5E) 

[E : H\ 

(see [8], Cor. 2.81) and we are done. □ 



An interesting special case of Example 12.71 is to take F = F% and G to be index 
2. Any realization G — > 7Ti(S) has sc1g(9E) = 1, and therefore any nongeometric 
realization produces an integral chain in B± (F) with scl < 1. Figured] is a his- 
togram showing the distribution of sc\p{i*dYj) on 7500 "random" realizations of 
G — > 7Ti(E) for a four-punctured sphere E. 



I . I 



jjjj,. 



Figure 1. Histogram showing distribution of scl^c'E) for 7500 
realizations of G 

This figure suggests the following conjecture: 

Conjecture 2.10 (interval conjecture). The set of values o/scl on integral chains 
in F>i (F2) contains every rational number in the interval [3/4, 1]. 



ISOMETRIC ENDOMORPHISMS OF FREE GROUPS 



7 



In fact, it is not known whether the set of values of scl on integral chains in any 
free group is dense in any interval, though it is known that this set is not discrete 
(see [7J, Thm. 4.7). 

Example 2.11. Let us look more closely at a single 2-parameter family. Let a,b,c 
generate an F$, and let ip : F$ — > F^ take a — > a, b — > b 2 , c — > b~ 1 ab. There is a Z 2 in 
Aut(i 7, 3 ) given by (to, n) : (a, b, c) — > (ab m , b, b n c). The image of the chain a + b + c + 
a~ 1 c~ 1 b~ 1 in B^ (P3) maps to a + b 2 + b~ 1 ab + a~ 1 b~ 1 a~ 1 b~ 1 . Precomposing with 
(to, n) produces the chain ab 2m + b 2 + b 2n ~ 1 ab + b~ 2m a~ 1 b~ 1 a~ 1 b~ 1 ~ 2n . Applying 
the automorphism a — > aB, b — > b of F2 to the image gives a chain which in Bi {F%) 
is equal to 

W m , n ■= ab 2 ^ 1 + ab 2 ^ 1 + b 2 + -2 6 -2"»-2« 

This is an example of a (2-parameter) surgery family, as defined in [7J. Computing 
s(to, n) :— scl(u>„ l; „) therefore reduces to the analysis of an explicit linear family of 
integer programming problems. Such problems are in general beyond the reach of 
computer experiments, but this particular family of examples is barely within reach 
of a rigorous analysis, and well within reach of a heuristic analysis, implemented 
by the program sssf |23j . 
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) for < n < to < 11 



Table Q] gives the value of s(m,n) for 0<n<TO< 11, and is included to give the 
reader an indication of the variety of values of sc\(ip m , n (a + b + c + a _1 c _1 6 -1 )) 
possible in even a simple family of nonisometric injections tp m ,n ■ F3 F-2. 

Examples 12.21 and 12.71 show that it is quite easy to construct injective homo- 
morphisms between free groups that are not isometric. However, we will show in 
§[3] that a random homomorphism between free groups is isometric, and we further 
conjecture (and provide evidence to suggest) that every injective endomorphism of 
a free group of rank 2 is isometric. 

3. HOMOMORPHISMS BETWEEN FREE GROUPS ARE USUALLY ISOMETRIC 

In this section we describe a certain small cancellation condition guaranteeing 
that a homomorphism between free groups is isometric. This condition is very sim- 
ilar to the condition C'(l/12) studied in small cancellation theory (see e.g. |16) . 
Ch. V), and is generic, in a sense to be made precise in the sequel. However, prov- 
ing that this condition suffices to guarantee isometry depends on some technology 
developed in the papers [5j[7], and a careful inductive argument. 
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3.1. Surfaces. If G is a group, let X be a K (G, 1). Conjugacy classes in G corre- 
spond to free homotopy classes of loops in X. 

Let gi € G be a set of elements, and let T : TJ i Sj — > X be a corresponding set 
of loops. A map of a compact, oriented surface / : S — >• X is admissible for T if 
there is a commutative diagram 

► S 



9/ 



/ 



and an integer n(S') for which 9/* [9S 1 ] = n(5 , )[JTj Sj 1 ] m H\. The map is monotone 
if — > Yii $1 is homotopic to an orientation-preserving cover (equivalently, if 
every component of (95 wraps with positive degree around its image). 



Lemma 3.1 ([8\, Prop. 2.74). Let g x , 



6e conjugacy classes in G, represented 

-x-(S) 



scl(^ 5i ) = inf 2n(5) 

where the infimum is taken over all surfaces S and all maps f : S 
for r. 



X admissible 



The notation x~{S) means the sum of Euler characteristics ^ i x(^i) taken over 
those components Si of S with x($i) < 0- By [5], Prop. 2.13 it suffices to restrict 
to monotone admissible surfaces. An admissible surface S is extremal if equality is 
achieved. 

3.2. Fatgraphs. In free groups, most admissible surfaces — and certainly all ex- 
tremal ones — can be represented in an essentially combinatorial way, that is conve- 
nient for small cancellation arguments. This combinatorial encoding is very similar 
to a method developed by Culler [12], though it is more or less equivalent to the 
theory of diagrams over surfaces developed by Schupp |21) . 




Figure 2. Part of a thickened fatgraph over F-i near a 3-valent vertex 

A fatgraph Y is a graph in which each vertex has valence at least 3, together 
with a cyclic ordering of the edges incident at each vertex. Such a graph can be 
thickened to a surface S(Y) (or just S is Y is understood) in such a way that Y 
embeds in S(Y) as a deformation retract (one also says Y is a spine in SiY)). A 
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fatgraph Y is oriented if S(Y) is oriented. In the sequel we assume all our fatgraphs 
are oriented. Note that x(F) = x(S(Y)). 

One can arrange for the deformation retraction S(Y) — >• Y to be locally injective 
on dS(Y). The preimages of the arcs of Y give dS(Y) a natural cellular structure, 
in such a way that arcs of dS(Y) map isomorphically to arcs of Y, and vertices of 
dS(Y) map to vertices of Y. Two arcs of dS mapping to the same edge of Y are 
said to be paired. 

A fatgraph Y over F is an oriented fatgraph in which each arc of dS(Y) is labeled 
with a reduced, nontrivial clement of F in such a way that paired arcs have labels 
which are inverse in F, and consecutive arcs (reading around dS) are reduced; see 
Figure [2] for part of a fatgraph over F2 near a 3-valent vertex (in this figure and 
elsewhere, we frequently adopt the notation A for a -1 and so on). For such a 
fatgraph, dS is labeled by a finite collection of cyclically reduced cyclic words in 
F, so we can (and do) think of the oriented boundary dS as an element of Sf (F), 
which we denote dS(Y). 

The basic fact we use is the following lemma, which is a restatement of 12J, 
Thm. 1.4 in the language of fatgraphs. Note that Culler proves his theorem only 
for surfaces with connected boundary, but his argument generalizes with no extra 
work (an equivalent statement, valid for surfaces with disconnected boundary, is 
also proved in [S], Lem. 3.4; also see [5] § 4.3 for a discussion and references). 

Lemma 3.2 (Culler [12], Thm. 1.4 (fatgraph lemma)). Let S be an admissible 
surface bounding a chain Y. Then after possibly compressing S a finite number of 
times (thereby reducing — x~(S) without changing dS) there is a fatgraph Y over 
F with S(Y) = S. 

In the sequel Y will usually denote an abstract (unlabeled) fatgraph, and Y will 
denote a labeled one. 

3.3. scl and word length. In a free group F with a fixed generating set, every 
element is represented by a unique reduced word, and every conjugacy class is 
represented by a unique cyclically reduced cyclic word. 

Define |T| = min^ \gi\, where | • | denotes word length in F, and the minimum 
is taken over all representatives Y = 9i of the class T in . Note that if we take 
each <7; to be primitive and cyclically reduced, and insist that no gi is conjugate to 
the inverse of some gj (in which case we could cancel gi and gj), then |T| = ^ Iff* I- 
In other words, any expression of r as ^ gi either satisfies |T| = ^ \gi\, or can be 
reduced in an "obvious" way. 

Lemma 3.3. Let Y be an integral chain in B^(F). Then scl(r) < |T|/2. 

Proof. In fact we prove the stronger statement that cl(r) < |T|/2. By the definition 
of commutator length of a chain, it suffices to prove this in the case that Y is a single 
word g 6 F'. This means that every generator x appears in g as many times as x~ l 
appears. Each such pair of letters can be canceled at the cost of a commutator, 
and the result follows. □ 

The bound in Lemma 13.31 is not sharp. With more work, we obtain a sharp 
estimate. The following lemma appeals at one point to a covering trick used in [7]; 
since the trick is not used elsewhere in this paper, we refer the reader to [7] for 
details. 
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Lemma 3.4. Let T be an integral chain in B^ (F2). Then scl(r) < |r|/8. 

Proof. Let r — ^ gi and by abuse of notation, suppose each gi is represented by a 
cyclically reduced word. Suppose without loss of generality that there are at most 
|r|/2 letters equal to one of a or A. After applying the automorphism a —¥ ab, b — >• b 
sufficiently many times, we obtain a new chain r' = hi with at most |T|/2 letters 
equal to one of a or A, but with no a 2 or A 2 in any of the cyclic words hi. 

Let Y be a fatgraph with dS{Y) = T', and let S — S(Y). We can decompose 
S into a collection of at most |T|/2 rectangles pairing up a's and A's, together 
with some subsurface S' with at most |T| corners, and edges alternating between 
segments of dS labeled by powers of 6, and edges corresponding to proper arcs in 
S. 

Counting as in [7] , each rectangle contributes to the "orbifold Euler characteris- 
tic" of S, and each corner of S' contributes —1/4. The total contribution is therefore 
at most |r|/4, so ~X~(S) < |r|/4 - X (S')- Now, it is possible that x(S') < 0, but 
since S' has boundary components labeled by elements of the abelian group (6), 
we can pass to a finite cover of S' and compress so that x(S') can be made "pro- 
jectively" as close to as desired; this is explained in detail in [7], § 3.3. Hence 
scl(r) = scl(r') < |r|/8, as claimed. □ 

In fact, it is not much more work to extend this Lemma to free groups of arbitrary 
finite rank. Let F be freely generated by x\,...,x n ] if T G B^(F), we denote by 
the number of times that Xi and x^ 1 appear in T. 

Proposition 3.5. With notation as above, we have an inequality 

, ,„x IT! — max,- |r|,- 

sci r) < l — 

4 

for any V G B?(F). 

Proof. Without loss of generality, we may assume that max, = |r| n . As in 
the proof of Lemma 13.41 we may cut out all rectangles corresponding to matched 
pairs of x\ and x^ 1 . What is left is an immersed subsurface S' of S. An essential 
immersed subsurface of an extremal surface is also extremal, by [6]. Consequently 
S' is extremal for its boundary L', which lies in B^ ({x2, . . . ,x n }). We therefore 
have the inequality scl(r) < scl(r') + |r|i/4. Repeating this argument n — 1 times 
yields 

sci(r) < sci(r") + |r|i/4 + ■ ■ ■ + |r| n _i/4 

where scl(r") = 0, since T" G Bf {{x n )). The proof follows. □ 

Example 3.6. The bound in Proposition [33] is sharp, which we show by a family of 
examples. We first recall the free product formula (jS], § 2.7), which says that if G\ 
and G2 are arbitrary groups, and gi G have infinite order, then sc1g 1 *g 2 (9192) = 
scl Gl (ffi) + scl G2 (5 2 ) + 1/2. 

Now, let F be freely generated by Xi, . . . , x n as above, and define 

w n = [xi,x 2 ][x 3 ,X4] ■ ■ ■ [z n _i,x n ] 

if n is even, and 

W n = [xi 1 X2]{xz,X i \ - ■ ■[x n -i,Xn~z]Xn~2X n -lX n X^\x n X n ^i 2 X~ 2 

if n is odd. 
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For each i, we have scl([xj, Xj+i]) = 1/2. Moreover, using scallop (|TT)) one can 
check that sc\(x n _2 x n-i x n x n-i x n x n-2 x n 2 ) = 1- The free product formula then 
shows that scl(io„) = (n — l)/2, so Proposition 13 . 51 is sharp for all n. 

3.4. Small cancellation condition; first version. A homomorphism between 
free groups is determined by the values of the generators, which can be taken to be 
reduced words. In this section and the next, we define combinatorial conditions on 
these words which guarantee that the homomorphism is an isometry of scl. 

For the sake of clarity, we first discuss a severe condition which makes the proof 
of isometry easier. Then in § 13.51 we discuss a weaker condition which is generic 
(in a certain statistical sense, to be made precise) and which also implies isometry, 
though with a slightly more complicated proof. 

Definition 3.7. Let A be a set, and let F(A) be the free group on A. Let U be a 

subset of F(A) with U n U~ l = 0, and let S denote the set U U U~ l . We say that 
U satisfies condition (SA) if the following is true: 
(SA1) if x, y £ S and y is not equal to x , then xy is reduced; and 
(SA2) if x, y £ S and y is not equal to x or x , then any common subword s of 

x and y has length strictly less than |x|/12; and 
(SA3) if x € S and a subword s appears in at least two different positions in x 
(possibly overlapping) then the length of s is strictly less than |x|/12. 
Let B be a set, and tp : B — > U a bijection. Extend tp to a homomorphism 
tp : F(B) — > F(A). We say tp satisfies condition (SA) if U satisfies condition (SA). 

Note that except for condition (SAl), this is the small cancellation condition 
C(l/12). We will show the following: 

Proposition 3.8. Let tp : F(B) — > F(A) be a homomorphism satisfying condition 
(SA). Then ip is an isometry of scl. 

Condition (SAl) for ip means that if g is a cyclically reduced word in F(B), then 
the word in F(A) obtained by replacing each letter of g by its image under tp is also 
cyclically reduced. This condition is quite restrictive — in particular it implies that 
\A\ > \B\, and even under these conditions it is not "generic" — but we will show 
how to dispense with it in § 13.51 However, its inclusion simplifies the arguments in 
this section. 

Example 3.9. The set {aa,bb} satisfies (SAl). The set {ab, ba} satisfies (SAl). 

Suppose tp : F(B) — > F(A) satisfies condition (SA), and let Y be a fatgraph with 
dS(Y) in the image of tp, i.e. such that dS(Y) is a collection of cyclically reduced 
words of the form <p(g). By condition (SAl), each tp(g) is obtained by concatenating 
words of the form <p( x± ) for x S B. We call these subwords segments of dS(Y), as 
distinct from the decomposition into arcs associated with the fatgraph structure. 

Definition 3.10. A perfect match'm Y is a pair of segments tp{x), ^(x -1 ) contained 
in a pair of arcs of dS(Y) that are matched by the pairing. A partial match in Y is 
a pair of segments tp(x), ^(x^ 1 ) containing subsegments s,s _1 in "corresponding" 
locations in tp(x) and </?(x _1 ) that are matched by the pairing. 

The existence of a perfect match will let us replace Y with a "simpler" fatgraph. 
This is the key to an inductive proof of Proposition 13.81 The next lemma shows 
how to modify a fatgraph Y to promote a partial match to a perfect match. 
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Lemma 3.11. Suppose Y contains a partial match. Then there is Y' containing a 
perfect match with S(Y') homotopic to S(Y) and dS(Y) — dS(Y'). 

Proof. The fatgraph Y can be modified by a certain local move, illustrated in 
Figure 



X 



X 



X 



Figure 3. A local move to replace a partial match with a perfect match 

This move increases the length of the paired subsegments by 1. Perform the 
move repeatedly to obtain a perfect match. □ 

Remark 3.12. The move illustrated in Figure [3] actually occurs as the phenomenon 
of branch migration in molecules of DNA, especially in certain 4-valent junctions 
known as Holliday junctions. See e.g. |18j . 

Each vertex v of Y of valence |t>| contributes (|u| — 2)/2 to — x(Y), in the sense 
that ~x(X) — Tli V {\ v \ ~ 2)/2. Since each vertex v of Y is in the image of \v\ vertices 
in dS, we assign a weight of (\v\ — 2)/2|u| to each vertex of dS. 

Lemma 3.13. Let Y be a fatgraph with dS(Y) = tp(T) and suppose that ip satisfies 
(SA). Then either Y contains a partial match, or — x(Y") > |T|. 

Proof. Observe that dS(Y) decomposes into |T| segments, corresponding to the 
letters of T. Suppose Y does not contain a partial match. Then since each vertex 
contributes (\v\ — 2)/2|u| to — x(Y), it suffices to show that each segment of dY 
contains at least six vertices in its interior. 

Suppose not. Then some segment <p(x) of dY contains a subsegment s of length 
at least |c/?(x)|/6 that does not contain a vertex in its interior. Either s contains a 
possibly smaller subsegment s' which is paired with some entire segment f(y), or 
at least half of s is paired with some s _1 in some <p(y). In either case, since s is 
not a partial match by hypothesis, we contradict either (SA2) or (SA3). 

Thus each segment contributes at least 7 x ((3 — 2)/2 • 3) = 7/6 to — %(Y), and 
the lemma is proved. □ 

We now give the proof of Proposition 13.81 

Proof. Suppose (p : F{B) — > F(A) satisfies (SA) but is not isometric. 

Let Y be a fatgraph with dS(Y) = <p(T) so that scl(<p(r)) < - x (S(Y))/2 < 
scl(r) (the existence of such a Y follows from § 13.21 for instance, we could take 
Y to be extremal). We will construct a new Y' with dS(Y') = <f(T') satisfying 
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scl(y>(r')) < -x(S(Y'))/2 < scl(r'), and such that Y' is shorter than Y. By 
induction on the size of Y we will obtain a contradiction. 

By Lemma l3.3l and Lemma l3.131 Y contains a partial match, and by Lemma l3.11l 
we can modify Y without affecting dS(Y) or x(Y) so that it contains a perfect 
match. A perfect match cobounds a rectangle in S = S(Y) that can be cut out, 
replacing S with a "simpler" surface S' for which dS' is also in the image of ip. By 
Lemma T3.2( there is some surface S" with — x(<S"') < — x(&') an< i dS" — dS', and 
a fatgraph Y' with S(Y') = 5"'. 

In the degenerate case that S" is a disk, necessarily S is an annulus, and both 
boundary components of S consist entirely of perfect matches; hence T = g + g^ 1 
and scl(r) = scl(y(r)) = in this case, contrary to hypothesis. Otherwise dS" = 
dS' = ip(T') for some T', and satisfies - X {S{Y')) < ~x(S') = ~x(S(Y)) - 1. 

On the other hand, T can be obtained from T' by gluing on a pair of pants; hence 
scl(r) < scl(r') + 1/2. We have the following "diagram of inequalities" 
sc%(r)) < - X (S(Y))/2 < scl(r) 
VI Al 
sc%(r')) + 1/2 < - X (S(Y'))/2 + l/2 scl(r') + l/2 

from which we deduce scl(</?(T')) < — X (S(Y'))/2 < scl(r') as claimed. Since each 
reduction step reduces the length of dS(Y), we obtain a contradiction. □ 

3.5. Most homomorphisms between free groups are isometries. In this 
section we weaken condition (SA), allowing partial cancellation of adjacent words 
tp(x) and <p(y). Providing we quantify and control the amount of this cancellation, 
we obtain a new condition (A) (defined below) which holds with high probability, 
and which implies isometry. 

If two successive letters x, y in a fatgraph do not cancel, but some suffix of 
<p(x) cancels some prefix of <p(y), we encode this pictorially by adding a tag to our 
fatgraph. A tag is an edge, one vertex of which is 1-valent. The two sides of the 
tag are then labeled by the maximal canceling segments in ip(x) and p(y). If T is a 
chain, and Y is a fatgraph with dS(Y) equal to the cyclically reduced representative 
of <p(T), then we can add tags to Y to produce a fatgraph Y' so that dS(Y') is 
equal to the (possibly unreduced) chain p(T). 

Definition 3.14. Let A be a set, and let F(A) be the free group on A. Let U be a 

subset of F(A) with U n U^ 1 = 0, and let S denote the set U U U~ x . We say that 
U satisfies condition (A) if there is some non-negative real number T such that the 
following is true: 

(Al) the maximal length of a tag is T; and 

(A2) if x, y 6 S and y is not equal to x or x , then any common subword s of 
x and y has length strictly less than (|x| — 2T)/12; and 

(A3) if x £ S and a subword s appears in at least two different positions in x 
(possibly overlapping) then the length of s is strictly less than (\x\—2T)/12. 

Let B be a set, and <p : B — > U a bijection. Extend ip to a homomorphism 
ip : F(B) — > F(A). We say <p satisfies condition (A) if U satisfies condition (A). 

Notice that condition (SA) is a special case of condition (A) when T = 0. 

Proposition 3.15. Let ip : F{B) — >• F(A) be an homomorphism between free groups 
satisfying condition (A). Then ip is an isometry o/scl. That is, scl(r) = scl(^>(r)) 
for all chains T G B± (F(B)). In particular, scl(p) = sc\(ip(g)) for all g £ F{B)' . 
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Proof. The proof is essentially the same as that of Proposition 13.81 except that 
we need to be slightly more careful computing x(X). We call the edges in a tag 
ghost edges, and define the valence of a vertex v to be the number of non-ghost 
edges incident to it. Then —\(Y) = Yj V (\ v \ ~~ ^)/^ wnere the sum is taken over all 
"interior" vertices v — i.e. those which are not the endpoint of a tag. 

The proof of Lemma f3. 131 goes through exactly as before, showing that either Y 
contains a partial match, or ~x(Y) > To see this, simply repeat the proof of 
Lemma f3 . 1 31 applied to Y with the tags "cut off". Partial matches can be improved 
to perfect matches as in Lemma 13.111 Note that this move might unfold a tag. 

If Y is a fatgraph with dS(Y) = ip(T) and scl(^(r)) < - X (S(Y))/2 < scl(r), 
we can find a perfect match and cut out a rectangle, and the induction argument 
proceeds exactly as in the proof of Proposition 13.81 □ 

Fix k, I integers > 2. We now explain the sense in which a random homomor- 
phism from Fk to Fi will satisfy condition (A). Fix an integer n, and let Fi(< n) 
denote the set of reduced words in Fi (in a fixed free generating set) of length at 
most n. Define a random homomorphism of length < n to be the homomorphism 
cp : Fh —> Fi sending a (fixed) free generating set for Fk to k randomly chosen 
elements of F[(< n) (with the uniform distribution). 

Theorem 3.16 (Random Isometry Theorem). A random homomorphism ip : F% — > 
Fi of length n between free groups of ranks k, I is an isometry o/scl with probability 
1 — 0(C(k,l)~ n ) for some constant C(k,l) > 1. 

Proof. By Proposition 13.151 it suffices to show that a random homomorphism sat- 
isfies condition (A) with sufficiently high probability. 

Let Ui, ■ ■ ■ ,Uk be the images of a fixed free generating set for Fk, thought of 
as random reduced words of length < n in a fixed free generating set and their 
inverses for Fi. First of all, for any e > 0, we can assume with probability at 
least 1 — 0(C~ n ) for some C that the length of every u, is between n and (1 — e)n. 
Secondly, the number of reduced words of length en is (approximately) (2l — l) en , so 
the chance that the maximal length of a tag is more than en is at least 1 — 0(C~ n ). 
So we restrict attention to the ip for which both of these condition hold. 

If (A2) fails, there are indices i and j and a subword s of of length at least 
n(l — 3e)/12 > n/13 (for large n) so that either s or s _1 is a subword of Uj. The 
copies of s ± are located at one of at most n different places in m and in Uj] the 
chance of such a match at one specific location is approximately (21 — 1)~"/ 13 , so 
the chance that (A2) fails is at most k 2 n 2 (2l ~ l)-"/ 13 = 0(C~ n ) for suitable C. 

Finally, if (A3) fails, there is an index i and a subword s of Ui of length at least 
n/13 that appears in at least two different locations. It is possible that s overlaps 
itself, but in any case there is a subword of length at least |s|/3 that is disjoint from 
some translate. If we examine two specific disjoint subsegments of length n/39, the 
chance that they match is approximately (21 — i)-"/ 39 . Hence the chance that (A3) 
fails is at most kn 2 (2l - l)-«/ 39 = 0(C-") for suitable C. Evidently C depends 
only on k and I. The lemma follows. □ 

Corollary 3.17. Let k,l>2 be integers. There are (many) isometric homomor- 
phisms ip : Fk — > Fi . 

Lemma 3.18. Let F be a finitely generated free group. The following holds: 
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(1) if there are integral chains Ti^Ts in B± (F) such that scl(Fj) = U, then 
there is an integral chain T in (F) with scl(r) = t\ + fa; and 

(2) if there are elements 51,32 in F' such that scl(<7,) = t\, then there is an 
element g £ F' with scl(g) = ti + <2 + 1/2. 

Proof. Let F\,F% be copies of F, and let Oi'.F — > Ft be an isomorphism. Then in 
case (1) the chain <7i(Ti) + (72^2) in F± * F2 has scl equal to t\ +fa, and in case (2) 
the element o\ (31)0-2(32) has scl equal to t\ + fa + 1/2; see jS], § 2.7. Now choose 
an isometric homomorphism from F\ * F2 to F, which exists by Corollarv l3.17l □ 

Corollary 3.19. Let F be a countable nonabelian free group. The image of F' 
under scl contains elements congruent to every element o/Q modi,. Moreover, the 
image of F' under scl contains a well-ordered sequence of values with ordinal type 
uj u . 

Proof. These facts follow from Lemma 13.181 plus the Denominator Theorem and 
Limit Theorem from [7]. □ 

4. ISOMETRY CONJECTURE 

Conjecture 4.1 (Isometry conjecture). Let ip : F% — > F be any injective homo- 
morphism from a free group of rank 2 to a free group F. Then ip is isometric. 

Remark 4.2. Since free groups are Hopfian by Malcev |17j . any homomorphism 
from F2 to a free group F is either injective, or factors through a cyclic group. 
Furthermore, since F2 is not proper of finite index in any other free group, every 
F2 in F is self-commensurating, and therefore no counterexample to the conjecture 
can be constructed by the method of Proposition 12.91 

Since any free group admits an injective homomorphism into F%, and since scl 
is monotone nonincreasing under any homomorphism between groups, to prove 
Conjecture 14.11 it suffices to prove it for endomorphisms <p : F2 — > i 7 ^- 

Remark 4.3. In view of Example 12.71 rank 2 cannot be replaced with rank 3 in 
Conjecture 14.11 

Conjecture 14.11 has been tested experimentally on all cyclically reduced homo- 
logically trivial words of length 11 in F2 , and all endomorphisms F% — > F2 sending 
a — > a and b to a word of length 4 or 5. It has also been tested on thousands of 
"random" longer words and homomorphisms. The experiments were carried out 
with the program scallop f|llj). which implements the algorithm described in [5] 
and [8]. 

In order to give some additional evidence for the conjecture beyond the results 
of § 13.51 we prove it in a very specific (but interesting) case for which the small 
cancellation conditions (SA) and (A) do not hold. 

Proposition 4.4. The homomorphism ip : F% — > F2 defined on generators a, b by 
<p(a) = abA, tp(b) = b is an isometry. 

Proof. The proof is by induction, following the general strategy of the proof of 
Proposition 13.81 and Proposition 13.151 but with a more complicated combinato- 
rial argument. As in the proof of those propositions, we assume to the contrary 
that there is some T and a fatgraph Y with dS(Y) = ip(T) so that scl(y(r)) < 
— x{S(Y))/2 < scl(F). If we can find a partial match in Y, then we can cut out 
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a rectangle and get a simpler fatgraph Y' and a chain r" so that scl(y?(r')) < 
—x(S(Y'))/2 < scl(r'), and we will be done by induction. We show now that such 
a partial match must exist. 

Note that each consecutive string a m in T gives rise to a string of the form ab m A 
in r', and each b m in T gives rise to a string of the form b m . We call copies of b or 
B in ip(T) of the first kind fake, and copies of b or B in <p(T) of the second kind real. 
Every b (real or fake) must pair with some B (real or fake) in Y. If a real b pairs 
with a real B, or a fake 6 with a fake £?, then we obtain a partial match, which can 
be improved to a perfect match by Lemma 13.111 and then cut out, completing the 
induction step. 

So we assume to the contrary that there are no partial matches, and every real 
b pairs a fake B and conversely. Assume for the moment that T has no subwords 
that are powers of the generators (these are called abelian loops in |7], and we use 
this terminology in what follows). Then each string of real 6's or B's in tp(T) is 
followed by a and preceded by A, whereas each string of fake 6's or B's in (p(T) is 
followed by A and preceded by a. Moreover, each a is followed by a fake b or B, 
and preceded by a real b or B. These facts together imply that each a or A in <p(T) 
is contained in an edge of length exactly 1. From this we obtain a lower bound 
on —x(S(Y)), as follows. If T contains n segments of the form a m and n of the 
form b m , then (assuming there are no abelian loops), there are exactly n edges of 
Y which pair a single a with an A. Removing these edges leaves a fatgraph with no 
1-valent edges, since a edges are never adjacent at a vertex. Hence each such edge 
contributes at least 1 to — %, and we obtain the inequality — x(S(Y))/2 > n/2. 

However, we claim that the form of T implies that scl(r) < n/2, contrary to 
hypothesis. This shows that there is a partial match after all, and therefore Y can 
be simplified. But by induction this shows that no such T and Y can exist, and the 
proposition will be proved. 

The inequality scl(r) < n/2 follows easily from the method of [7] (in fact, the 
stronger inequality scl(r) < [n — l)/2 (achieved for T = abAB) is true, but we 
do not need this). In § 3 of that paper, it is shown that for V of the desired 
form, scl(r) = minj, e yn/2 — (ka{u) + Ks(y))/2, where ka and kb are certain 
piecewise linear non-negative functions, and y ranges over a certain rational convex 
polyhedron Y. The desired inequality (and the proof) follows, ignoring abelian 
loops. 

Each abelian loop of T reduces the count of a edges in Y by 1, but ([5], p. 9) 
also reduces the upper bound on scl(r) by 1/2. In other words: 



Example 4.5. In [7] § 4.1 it is shown that scl(a m + B m + aBA m+1 b m+1 ) = {2m - 
l)/2m for to > 2. Under ip, the image of a m and B m cancel, and one obtains 
the identity scl([a, b] [a, B m+1 ]) = (2m — l)/2m for to > 2. This family of words 
is discussed in [8] § 4.3.5 and an explicit collection of bounding surfaces exhibited. 
Proposition 14.41 certifies these surfaces as extremal. 

Example 4.6. The homomorphism if arises naturally as the inclusion of Fi as a 
factor in Fqo, the first term in a short exact sequence — > F2 — > Z, where the 



sci(r) 



minn/2 — #{abelian loops}/2 — (nA{y) + Ks(y))/2 



so the desired inequality holds in this case too. 



□ 
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F2 — > 7L kills one of the generators. It is not true that inclusions of bigger factors F n 
in Foo are isometric. For example, scl([o, 6][c, d]) = 3/2, but scl([a, a b ][a b ,a b ]) = 1. 

5. LABELINGS OF A FATGRAPH ARE USUALLY EXTREMAL 

In this section, we show that for an arbitrary topological fatgraph Y, a random 
labeling of its edges by words of length n is extremal for its boundary with probabil- 
ity 1 — C~ n . Notice that such a labeling defines a random groupoid homomorphism 
from the edge groupoid of Y to a free group F. Such a groupoid homomorphism 
in turn induces a homomorphism from iti(Y) to F, but such a homomorphism will 
never satisfy property (A) if Y has more than one vertex, since the generators of 
7Ti(y) necessarily map to words in F with big overlaps, corresponding to common 
subedgcs of Y. 

One significant feature of our construction is that the proof that a typical labeling 
Y of Y is extremal comes together with a certificate, in the form of a (dual) extremal 
quasimorphism. Producing explicit extremal quasimorphisms for given elements is 
a fundamental, but very difficult problem, and as far as we know this is the first 
example of such a construction for "generic" elements (in any sense) in a hyperbolic 
group. 

The construction of the extremal quasimorphism dual to a "generic" fatgraph 
is somewhat involved; however, there is a special case where the construction is 
extremely simple, namely that of trivalent fatgraphs. Therefore we first present the 
construction and the proofs in the case of trivalent fatgraphs, deferring a discussion 
of more general fatgraphs to § 15.51 

5.1. Quasimorphisms. Recall that if G is a group, a quasimorphism is a function 
(j> : G — y R for which there is a least non-negative number D(<f>) (called the defect) 
so that for all g, h S G there is an inequality 

\4>(gh) - 4(g) - 4>{h)\ <D(<f>) 

A quasimorphism is further said to be homogeneous if it satisfies 4>(g n ) = n<j)(g) for 
all g e G and all integers n. 

If 4> is an arbitrary quasimorphism, its homogenization 4> is defined to be the limit 
4>(g) '■= lim rl ^oo 4>(g n )/n. It is a fact that 4> with this definition is a homogeneous 
quasimorphism, with D(<j>) < 2D(<f>). See [8], § 2.2. 

Rhcmtulla [TH], and then later Brooks [3], gave an elementary construction of 
quasimorphisms on free groups, which we refer to as counting quasimorphisms. For 
a word w £ F, define the big counting function C w by the formula 

C w (v) = number of copies of w in v 

Then H w = C w — C w ~i is a quasimorphism, called the big counting quasimorphism 
for w. The function H w counts the difference between the number of copies of 
w and of u> -1 in a given word, and its homogenization H w counts the difference 
between the number of copies in the associated cyclic word. For such functions one 
has D(H W ) = 2D(H W ). 

Following Epstein-Fujiwara |13j . we define a variant on this construction as 
follows. For a given set S C F, denote by S~ x the set of inverses of elements of S, 
and define the small counting function cs by 

cs(^) = maximal number of disjoint copies of elements of S in v 
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So for example, ci a t, t b Qj t,b) (abba) = 2. Define hs '■= cs — c$-i to be the small 
counting quasimorphism for S. 

A significant property of small counting quasimorphisms (by contrast with the 
big counting quasimorphisms) is that there is a universal bound on their defect, 
which (except in rare cases) is sharp. 

Lemma 5.1. For any S C F, we have D{h$) < 3 and D(hs) < 6. 

Proof. There is a standard method to estimate defect of counting quasimorphisms 
and their variants, which we describe. First, note that hs is antisymmetric, i.e. 
hs(w) = — /is(u> _1 ) for all w E F. This is a property that will be shared by all the 
quasimorphisms we consider in the sequel. 

Now, given any g, h there are reduced words k, I, m so that the words kL, IM, mK 
are all reduced, and represent gh, g^ 1 ,h~ l respectively. We think of the words 
k, I, m as the labels on the incoming edges on a tripod Y (thought of as an especially 
simple kind of fatgraph) and observe that the oriented boundary dS(Y) = kL + 
IM + mK. Since hs is antisymmetric, it suffices to compute hs(kL + IM + mK). 

We refer to the 3-valent vertex of the tripod as the junction. By the definition of 
small counting functions, if k, L and kL are all reduced words, then < cs(kL — k — 
L) < 1, since any collection of disjoint S- words in k and L produces such a collection 
in kL not crossing the junction, whereas any collection of disjoint ^-words in kL 
contains at most one that crosses the junction. Symmetrizing, \hs(kL — k — L)\ < 1. 
But then we can compute 

\h s (kL + lM + mK)\ = \h s {kL-k-L) + h s {lM-l-M) + hs(mK-m-K)\ < 3 
Homogenizing multiplies the defect by at most 2, and the lemma is proved. □ 

It is the sharpness of this estimate that will allow us to use small counting 
quasimorphisms to calculate scl exactly. 

5.2. Labeling fatgraphs. We use the notation Y for an abstract (unlabeled) fat- 
graph, and Y for a labeling of Y by words in F; i.e. a fatgraph over F (see § 13.21) . 
A labeling of length n is a reduced labeling for which every edge of Y is a word of 
length n. 

By our convention, boundary words in dS(Y) must be cyclically reduced. For a 
labeling in which boundary words are not reduced, one can "fold" adjacent canceling 
letters to produce tags as in § 13.51 One can then either cut off tags, or think of 
them as "ghost" edges to be ignored. Note that folding in this sense is a restricted 
kind of folding in the sense of Stallings [22] , since the folding must respect the cyclic 
ordering of edges incident to a vertex. Hence a fatgraph which is completely folded 
(equivalently, for which dS(Y) is cyclically reduced) is not a priori 7ri-injective. 

5.3. The vertex quasimorphism construction. In this section, we construct a 
(counting) quasimorphism on F from a fatgraph Y over F. We will call this the 
vertex quasimorphism of Y. We will see that this vertex quasimorphism is typically 
extremal for dS(Y). 

Define a set ay on a labeled fatgraph Y over F as follows: every boundary 
component of S(Y) decomposes into a union of arcs, and each arc is labeled by an 
element of F. Between each pair of arcs is a vertex of dS(Y) (associated to a vertex 
of Y) . For each vertex of Y and each pair of incident arcs with labels u and v (u 
comes into the vertex; v leaves it), decompose u and v into u = U\U2, v = V\V2, 
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where usually we expect u\ and 112 to each be approximately half the length of u, 
and similarly for v\, «2, v, and add the word U2V1 to the set ay. There is some 
flexibility here in the phrase "about half the length" which will not affect our later 
arguments; in fact this flexibility indicates possible other constructions, in which 
the pieces have different sizes, bounded length, etc. 

A vertex quasimorphism for Y is a small counting quasimorphism of the form 
h aY . See Figure |4] for an example. In this figure, cry is the set 

ay = {bbAb, aBAA, aaaa, AbAA, AbaB, BBaB} 

Note that we have not broken the edges exactly in half, or even in the same place 
on either side. 




Figure 4. The vertex quasimorphism construction on a thrice- 
punctured sphere. 



Lemma 5.2. If no element of o Y x appears in the boundary dS(Y), then there is 
an inequality h aY (dS(Y)) > \ v \> where the sum is taken over all vertices v, and 
\v\ is the valence of the vertex v. 

Proof. Note that since the components of dS(Y) are cyclic words (rather than 
words), it only makes sense to apply the homogenized functions c and h to them. 
Since no element of ay appears in dS(Y), we have c CT -i (dS(Y)) = 0, so 

h aY (dS(Y)) = c~a Y {dS(Y)). For every vertex of Y and for each incident edge, we 
have a word in ay. By construction, these words do not overlap in the boundary 
chain dS(Y), so the value of c aY (dS(Y)) is at least as big as J2v M- ^ 

Remark 5.3. Note that it is possible for a strict inequality in Lemma 15. 2\ since 
there may be many different ways to put disjoint copies of elements of ay in dS(Y). 
However, if Y is trivalent and ay satisfies the hypotheses of the lemma, then there 
is an equality h aY (dS(Y)) is equal to three times the number of vertices of Y. 
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5.4. Trivalent fatgraphs are usually extremal. We say that a fatgraph Y over 
F satisfies condition (SB) if there is a choice of ay as above so that no element of 
cry 1 appears in dS(Y). 

Lemma 5.4. If a trivalent labeled fatgraph Y satisfies condition (SB), then both 
S(Y) and h aY are extremal for the boundary dS(Y), and certify each other. 

Proof. For a trivalent graph, h aY (dS(Y)) > 3V, where V is the number of vertices, 
by Lemma 15.21 By Bavard duality, and Lemma 15.11 there is an inequality 

sci (as y > — > > - 

2D(h aY ) AD(h aY ) 4 

On the other hand, since Y is trivalent, the number of edges is 3V/2, so x(S(Y)) — 
— V/2. Hence we get a chain of inequalities 

scl{dS(Y)) > 3F - > ~ = > sc\(dS{Y)) 

K y " - 2D{h aY ) ~ 4 2 - \ \ n 

Hence each of these inequalities is actually an equality, and the lemma follows. □ 

We now show that condition (SB) is generic in a strong sense. Given Y, we are 
interested in the set of Y with dS( Y) reduced obtained by labeling the edges of Y 
by words of length at most n. For each n, this is a finite set, and we give it the 
uniform distribution. 

Proposition 5.5. For any combinatorial trivalent fatgraph Y, if Y is a random 
fatgraph over F obtained by labeling the edges of Y by words of length n, then S(Y) 
is extremal for dS(Y) and is certified by some extremal quasimorphism h„ Y , with 
probability 1 - 0(C(Y, F)~ n ) for some constant C(Y,F) > 1. 

Proof. The constant C(Y, F) depends only on the number of vertices of Y. We 
make use of some elementary facts about random reduced strings in free groups. 

If we label the edges of Y with random reduced words of length n, it is true 
that there may be some small amount of folding necessary in order to obtain a 
fatgraph with dS(Y) cyclically reduced. However, the expected amount of letters 
to be folded is a constant independent of n, which is asymptotically insignificant, 
and may be safely disregarded here and elsewhere for simplicity. 

Now consider some element w of <jy under some random labeling. The fatgraph 
Y over F will satisfy condition (SB) with the desired probability if the probability 
that w~ x appears (as a subword) in dS(Y) is C~ n , because the number of elements 
of try is fixed (note that we are using the elementary but useful fact in probability 
theory that the maximum probability of a conjunction of extremely rare events is 
well approximated by assuming the events are independent). 

If u> _1 appears in dS(Y), then at least half of it must appear as a subword of 
one of the edges of Y, so the probability that w^ 1 appears in dS(Y) is certainly 
smaller than the probability that the prefix or suffix of w of length n/2 appears as 
a subword of an edge of Y. Let k denote the number of edges of Y. The proba- 
bility that a subword of length n/2 appears in a word of length n is approximately 
(n/2)r&nk(F )~™/ 2 , so, as we must consider each edge and its inverse, the probabil- 
ity that w _1 appears is smaller than 2/c(n/2)rank(F) - ™/ 2 . By replacing rank(F) by 
a slightly smaller constant, we may disregard the (n/2) multiplier, and the lemma 
is proved. □ 
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5.5. Higher valence fatgraphs. For fatgraphs with higher valence vertices, the 
construction of a candidate extremal quasimorphism is significantly more delicate. 

For to > 3 let K m be the complete graph on to vertices. Label the vertices 
0, f , 2, • • • , to — f . Define a weight w m on directed edges (i, j) of K m by the formula 
w m (i, i + k) = 3 — (6/s/to) where indices are taken mod to. 

Lemma 5.6. The function w m (i,i + k) := 3 — (6fc/m) is the unique function on 
directed edges of K m with the following properties: 

(1) It is antisymmetric: w m (i,j) — —w m (j,i). 

(2) It satisfies the inequality \w. m {i,j)\ < 3 — 6/to for all distinct i,j. 

(3) For every distinct triple i,j,k, there is an equality W m (i,j) + w m (j,k) + 
w m (k,i) = ±3 where the sign is positive if the natural cyclic order on k 
is positive, and negative otherwise. 

(4) It satisfies w m (i,i + 1) = 3 — 6/to for all i. 

Proof. Only uniqueness is not obvious. If we think of simplicial 1-cochain 

on the underlying simplicial structure on the regular to — 1 simplex then condition 
(3) determines the coboundary Sw m , so w m is unique up to the coboundary of a 
function on vertices. But condition (4) says this coboundary is zero. □ 

For x a reduced word in F, parameterize x proportional to arclength as the 
interval [—1, 1], and let x[—t, t] denote the smallest subword containing the interval 
from x{— t) to x(t). Fix some small e > and define the stack function S x to be 
the following integral of big counting functions: 



The e correction term ensures that the length of the shortest word in the support 
of S is at least e\x\. If a; is quite long, this word will also be quite long, and ensure 
that there are no "accidents" in what follows. The constant e we need is of order 
l/(max l) \v\); we leave it implicit in what follows, and in practice ignore it. 

Remark 5.7. The function S x is actually a finite rational sum of ordinary big count- 
ing functions, since x[— t, t] takes on only finitely many values. We can make it into 
a genuine integral by first applying the (isometric) endomorphism ip m to F which 
takes every generator to its mth power, and then taking lim„ woo <PmS<p m {x) m pl ace 
of S x . However, this is superfluous for our purposes here. 

We are now in a position to define the quasimorphism Hy . 

Definition 5.8. Let Y be a fatgraph over F, and suppose that every edge has 
length > 2n. For each vertex v, denote the set of oriented subarcs in dY of length 
n ending at v by Xi(v), where the index i runs from to |i>| — 1 and the cyclic order 
of indices agrees with the cyclic order of edges at v. Denote the inverse of Xi(v) by 




Xi{v). 



Then define 



h y = Y1 ( 3 -( Qk /\ V \))( S Xi(v)X i+h (v)-S x . +k ( v ) Xi (v)) 



v i,i+k%\v\ 



(note that the factor 3 — (6fc/|t>|) is i + k) from Lemma \5.6\i . 
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Let a denote a word of the form x%{v)Xj(v) or its inverse. In other words, 
the a are the words in the support of Hy. Now say that Y satisfies condition 
(B) if, whenever some o~[a, b] appears as a subword of some other a', or some 
a [a, b] or its inverse appears twice in a, then (b — a) is not too big — explicitly, 
(b — a) < 6/4(max 1 , \v\). Hereafter we denote 6 := 6/4(max t , |v|). 

Lemma 5.9. Suppose Y satisfies condition (B). Then D{Hy) < 3. 

Proof. Condition (B) says that if two distinct a, a' overlap a junction on one side of 
a tripod, then S a , S a * each contributes at most 5 to the defect. So we can assume 
that on at least one side, there is a unique a = Xi(v)Xj(v) with a subword of definite 
size that overlaps a junction. Again, without loss of generality, we can assume that 
the junction is at a(t) where t G [— 1 + S, 1 — 5]. By condition (B), if a' on another 
side has a subword of definite size that overlaps the junction, it either contributes at 
most (5, or else we must have a' = Xk{v)Xi{v) or a' — Xj(v)Xk{v). So the only case 
to consider is when the three incoming directed edges at the junction are suffixes 
of Xi(v), Xj -,(v), Xk(v) of length l>s><>u>0 respectively. But in this case the 
total contribution to the defect is u(w\ v i (i, j)+w\ v t (J, fc)+iui„| (k, i)) + (t—u)w\ v \(i,j). 
Since |ioi i + W\ v i (J, k) + w\ v t(k, i)\ — 3 and («, | < 3, this defect is < 3, 
as claimed. □ 

Theorem 5.10 (Random fatgraph theorem). For any combinatorial fatgraph Y, if 
Y is a random fatgraph over F obtained by labeling the edges ofY by words of length 
n, then S(Y) is extremal for dS(Y) and is certified by the extremal quasimorphism 
Hy. with probability 1 — 0(C(Y,F)~ n ) for some constant C(Y,F) > 1. 

Proof. The argument is a minor variation on the arguments above, so we just give 
a sketch of the idea. 

It suffices to show that a random Y satisfies condition (B) with probability 
I — 0(C~ n ) for some C. But this is obvious, since the Xi(v) are independent, and 
for any constant k > 0, two random words in F of length n do not have overlapping 
segments of length bigger than nn, and a random word of length n docs not have a 
segment of length bigger than nn that appears twice, in either case with probability 
l-0{C~ n ). □ 

Remark 5.11. Since x(S(Y)) e Z, the boundary 8S(Y) satisfies scl(8S(Y)) e |Z. 
On the other hand, Theorem 15.101 does not imply anything about the structure of 
scl for generic chains of a particular length. A random homologically trivial word 
(or chain) in a hyperbolic group of length n has scl of size 0(n/ logn) (see [10]), so 
a random homologically trivial word of length n conditioned to have genus bounded 
by some constant, will be very unusual. 

In fact, computer experiments suggest that the expected denominator of scl(ui) 
is a proper function of the length of a (random) word w. 

There are only finitely many distinct combinatorial fatgraphs with a given Eulcr 
characteristic, so if we specialize Y to have a single boundary component (recall 
this depends only on the combinatorics of Y and not on any particular immersion), 
then then we see that for any integer m there is a constant C depending on m so 
that a random word of length n conditioned to have commutator length at most m 
has commutator length exactly m and scl = m — 1/2, with probability 1 — 0(C~"). 
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Figure 5. A plot of — log of the failure rate for random labelings 
of lengths between 4 and 11, plotted for each trivalent fatgraph 
with four vertices. Each dot represents 500,000 trials. The fat- 
graphs themselves are arranged left to right, top to bottom in 
decreasing order of — log of failure at length 11, so the tripod in 
the lower right is the "hardest" to find vertex quasimorphisms for. 
The pictures were created using wallop 24 . 

5.6. Experimental data. Our main purpose in this section is to give an experi- 
mental check of our results and to estimate the constants C(Y, F). However, it is 
worth mentioning that vertex quasimorphisms provide quickly verifiable rigorous 
(lower) bounds on scl. 

5.6.1. Fast rigorous lower bounds on scl. Although not every chain admits an ex- 
tremal surface which is certified by a vertex quasimorphism, it happens much more 
frequently that a vertex quasimorphism certifies good lower bounds on scl. For 
example, if Y is not trivalent, a quasimorphism of the form h aY will never be ex- 
tremal; but if the average valence of Y is close to 3, the lower bound one obtains 
might be quite good. 

Because verifying condition (B) requires only checking the (non)-existence of cer- 
tain words as subwords of the boundary dS(Y), plus a small cancellation condition, 
it is possible to certify the defect of a vertex quasimorphism in polynomial time. 
This compares favorably to the problem of computing the defect of an arbitrary 
linear combination of big counting quasimorphisms (or even a single big counting 
quasimorphism) for which the best known algorithms are exponential. 

Example 5.12. It is rare for (short) words or chains to admit extremal trivalent 
fatgraphs. A cyclic word is alternating if it contains no a ±2 or 6 ±2 substring; for 
example, baBABAbaBabA is alternating, with scl = 5/6. An extremal fatgraph 
for an alternating word necessarily has all vertices of even valence, since the edge 
labels at each vertex must alternate between one of and one of b^. 

5.6.2. Experimental calculation of constants C(Y,F). While certifying a vertex 
quasimorphism is easy, finding one is much harder. To verify our asymptotic results, 
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Figure 6. A plot of — log of the failure rate for labelings of the 
fatgraph (circles) , and — log of the failure rate after acting by many 
random homomorphisms (squares). 



we can be content with breaking the edges of the fatgraph into uniform pieces and 
checking whether condition (B) is satisfied. However, for a given fatgraph, it might 
be the case (and usually is) that while a naive assignment of words for Hy fails, a 
more careful choice succeeds. To check whether there is any vertex quasimorphism 
is (naively) exponential, and this makes large experiments difficult. 

However for trivalent fatgraphs, condition (SB) on ay is much simpler. In partic- 
ular, whether or not a collection of words satisfies (SB) depends only on the (local) 
no-overlap condition, plus the "constant" condition of certain words not appearing 
in dS(Y). This makes this a priori infeasible problem of checking whether there 
is any vertex quasimorphism for a particular fatgraph possible with the use of a 
"meet-in-the-middle" time-space tradeoff. 

Using this method, we can experimentally estimate the best possible constants 
C(Y,F), at least in the case of trivalent Y. Figure [5] shows some data on the 
likelihood that a random labeling of a trivalent fatgraph with four vertices ad- 
mits a vertex quasimorphism. The linear dependence of — log(P(fail)) on label 
length is evident. We can calculate a best fit slope and y-intercept for these lines, 
which gives a best fit line of 1.47336n — 1.42772, or equivalently, P(success) = 
1 - 4.16918(4.36387)-™. Note that the lower right graph is the least likely to admit 
a vertex quasimorphism; this is heuristically reasonable, since self-loops at vertices 
handicap the graph by forcing a shorter length on some words in ay . A best fit for 
this line yields P(success) > 1 - 82.3971(3.19827)"™. 

5.6.3. Using homomorphisms to improve success rate. When a particular labeling Y 
does not admit a vertex quasimorphism, it might still be possible to find an extremal 
quasimorphism by applying a homomorphism <f> to Y . If (the folded fatgraph) <j)(Y) 
admits an extremal vertex quasimorphism H^y), and folding does not change the 
Euler characteristic of the fatgraph, then the quasimorphism tf^H^ry) ^ s extremal 
for dS(Y). 
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Because the edges of 4>(Y) are no longer random (and in particular, distinct 
edge labels will necessarily share long common subwords), it is not clear that ap- 
plying a homomorphism will affect our success rate. In fact, it turns out to help 
significantly, especially for shorter labelings. Figure [6] shows — log of the failure 
rate for a particular fatgraph compared with — log of the failure rate after applying 
many random homomorphisms. We decrease the probability of failure by a factor 
of about 5. Interestingly, changing the length of the homomorphism or the number 
of homomorphisms that we try does not seem to significantly alter our success with 
this procedure. 
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